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[ V * * Perspectives on Panel Analysis t . 



Thou|h 0.cioiagis ts have 'employed parigl designs for wome time , there 



r 



is little agreement about the formulation and > estimation of panel models* 
In. part this lack of agreement^re fleets "the varied purposes that sociologists 
\ . ■ 'bring to. panel analyst*. Our motivations for choosing this design were 



» V, sketched In the previdus chapter , The purpose of this chapter is to 
outline the class of models and estimators we us e^ and our reasons for 

1 / - ■ - - . 0 '* ' J ...... 

using them. To place pur research in a broader 'methodolpgicpl context , 

I contrast our procedures w%th certain widely used alternatives, 

| As we discussed in the previous chapters our goal is to study" the 

effects of various institutional secloA on each "other, The literature, on 

national development suggest^ that effects may run in all directions, that 

is, that fejguback effects occur more often than not, ^ For simplicity- 1 

y . • , . * ■,.' > 

treat the case where there are only two variables of concern, X' and Y. - 

i ■ "*t * .« 

We might think of school enrollments and national production, or national 

production and economic dependence, etc. Suppose* further that the variables 

( jire measured at only two points in time, t and t . Then the operative 

fe 0 / • 



question is how to model, the causal relations among the four variables 5 

' Vv Y o' ;md V- ' • 'J ■ 

Suppose we had the ability to experiment. We would hold %L constant 

j; 1 - ■ • . , " ■ - 

(through either ^andomizatioiHor nonvariation) , introduce variation in Y 



and observe the consequences for -X > In another expe^^int 



would 



similarly vary X^ and observe -consequences for Y * In each case weS^ou,ld. 



examine the impacts of variation in one lagged variable on changes in th 
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bjthiii-Jwr table; The nenexperipenj|al model th^t We and most other social 

''.■'v'/.' w ; ■ .. ■ ! " '" H; ' ; ' ', 1 . y* ; ■ ,.■ 'J v •■• : • !>■ ■ ^ 

scientists us'er. Is a simple analogue' to this pair of experiments . We write 
a two equation system in ^jhteh variation in each outcpme variable at 
tiie ? -'t XA a linear function of its initial level and of the initial level^ 
of the ofcher variable; . , • : 0 . '. : .. = : . > • 

For shorthand I refer to the model in (1-2) as the basic model .' ' . , r 

Each equation states that the ou t e ome d e pend.s on the pair "of causally 
predetermined variables and on a disturbance that jwe assume is tin - 
correlated with ^he right hand side variables (but see below). Our, , - 
interest focuses on Qf^ and B^. These parameters indicate the degree 
i to which variations in one variable afieet * another % holding constant * 
initial levels. If = 0, we conclude that X does not affect Y over 
time; and if 0j " = ■' 0 , we conclude that Y does not affect X over time, 
. - But this is just .the usual structural equation or causal modeling . 

s " i • - ;: •„*■>'. .:; ?" • ' ; V , . \. .* ■ 

perspective. The pair of - equations, (1) and (2) Sf embody assumptions / 
both about the nature : of the process .and about the behavior of disturbing 
'•' factors (summarised in u and v) , 'Quality o £ in f exenc e sSBb ou t the causal, * 



relations obviously jdepend on the accuracy of both typei of assumption. 
There is no magic - in panel analysis. '-Nor is ^here a special thedry of 

statistical inference for panel analyses. Once ^he model is formalized 

- ■ ■ ■ '. ■ - , > . -f 

in a set of structural equations , the methods used are identical to 

those uied for structural equation models generally. 




frop that wh^ch informs Mc^-soeial 



sclent if ie jwhtk on ,f aa^l ^a'atysis * Iti is more fcomrori to p'reposi 



specialized Methods for ippel inference. Mostibf this work followr'the 



tradition ©fgun ^y^aza^^to *(:195„5* I\9 7 X ), [which poses ^the method 
;;• q log i ca 1 problem 



C 

Given that X* and) Y are correlated, construct 



Masu^es pf effect Wfifc permit one to ' % stone iude that X r causes &Y or Y causes 
X (but not both)* LaearsfaLd and Jhis stfudentS constructed indexes (for 



/V 



7 



sixteenfold tables) that jseej^ to answer this question. 1 1 This approach . 
has been general ized^ to^quantitative analysis in the form of cross-lagged 
correlation Xs^e Campbell an,d Stanley ,1963; Peli and Andrews 1964* Kenny 1973), 



V Undoubtedly tfrerfe^, are some situations in which 'X causes Y but not 



-vice versa. But, in general, "does X/cause Y or Y cause X? ,f is .not' the 
tight question. As we argued in tne previous chapfer, for many macro- 

„ ; . *, * ' * ' = | • *"**"*•"■. 

sociological questions it is reasonable to begin with the assumption 
, that X causes Y , and Y causes X. This issue is not merely rhetorical. 
Attempting toychopse either X or Y as the caus.al variable leads to 
testing procedures /that are far from, opt ijfoaT when both variables have 
effects* ^ Thus we eschew this approach to panel analysis, 
/■'■j. ■ So we work squarely 'in the structural- equation tradition* We presume 
familiarity {fit*h* the logic and procedures of this approach at « the level' 
of. Duncan ^1975) or Namboodiri, Blalock, and Carter (1975). r In several 
cases, we use more kdvanced methods. They are noted below ' and* d iscussed 



in the chapters in- which they arise, The remainder of this chapter is devoted 

■■ ■ • ■ r; " /■ * f 

. • ' • * - 

tq a consideration of alternative specifications of the ''relations amo'ng X^. , 

Y n , ^ft a ' and Y , and to various methodological issues that arise ia our 



1 

i 



work. The purpose 0Y t 



this discussion is to ftlace our work in a broader 
, , methodological context y to^note limitations on our findings, and to 
suggest ways in which future researchers can improve ,on our work. 

Why Not Use giaicr.-Sco'res? ' 4 

• ;' . '.' %r. ■' 

In discussing our work with other sociologists we are often asked 

Why we do riot relate changes in X to changes in ¥ arid vice versa. 

Addressing this question lea^s to consideration, of issues that clarify 

the interpretation of estimates of the parameters of fcheSnodel in (1-2), 

It will facilitate exposition to consider Separately use of change 



scores ^as dependent variables and as causal variables. To keep the 
algebra simple we will consider only the Y-equation. 

The first, model we consider relates changes ? in Y over the period of 
observation to the initial levels of X and a disturbance, w; 



AY t > Y t 



But the model in (3) is just . the, special case of (l) where a m i, g 0 

O 1 

two things can happen. The restriction that Of, ' ■ 1 may*be correct *in 
which case estimation of (1) will give%pproxya£t^ly tHfct result. Then 
it does not matter whether one estimates (1) or (3)r Oh "the other hand, 
the restriction may be incorrect. In that case estimation of the change/ 
score model .usually gives biased ssfimates of the Effect of X Q dn ¥ .""".To 



see 



or 



thii note that hy^iubtraqting Y n from each sid^ of (3) w^ obtain 



Xn " + a , Y n + <*oX rt + U - Y 



1 0 % 2"0 



• where q -^(a^-^i'Yy + u 



■ r -- > : — ■ ■■■■■ - - — — :-^r— >' .— 

Equation (4) h^s exactly the same form as tl$e change score model except 
that .the disturb Ace, q, , contains Y~, If X'and Y are causally related 

'over ^time, it will almost always be the case that they will be correlated 

. * >u : -. \ ■ ' v . W" ... ■ " 4 ' 

*\ cross^sectionally. If so, the disturbance, q, Is correlated with X ft and 

* least squares esftim^tes of (4) wl^l be biased. The bias, is ^ero only 

when theWorastrai^t is true,' i.e. ot^ » 1. The model in (1) avoids such 

bias,- . . t ■ '* ' ^. ' , 4" ■ . 

'What about changes in X over, the period of observation 7 ' *As long 

as the two- equation' model in (1-2^ is appropriate, both X and Y will 

. change over the s&udy period . Jn an experiment, we can study one process 

at a time ; ^ Lacking such controls , we cannot pre'sjpme that JX', . say, is ? 

fixed at X^ .over the period we observe. To do s6 leads to an obvious 

inconsistenc^when we shift attention to thjaf X equation/ That is, ifi 

equation (2) models change, in X j^r the period (t£ 9 tj, we cannot 

• . .. ■ ••' ' : 4 ' ■ , = * - 

maintain that X is constant in |he equat for Y . • / ■'■ 

• ' - ■ „ : ' , C ^ ' : ) - l / 

, ; From the perspective of "causal analysis th# re^l issue is Whether 

: '■■ • : .i **\ 

changes in- X over the study period, affect- Y fc (hold ihg constant Y Q and- X Q ) . 

■ -.' x 9 . = ■ . 4 = . .. ' . ; 

This important question i has far reaching implications as we^shail see ^ 

- 1 V " 1 ■ 1 ■ •* * 4 , v " 

below. A -^e can place the methodological problem In tlear^ fociSs by. 

: "# - ■ ".' ' 1 * \ a ' f ,i « ' 

wriiing- a model that incorporates the effect s^bf changes in each 



9 



r 



^ ' variable- 



■ for the. moment we presume that the m^del in Xi)-'> )« is tne. true' model, 

s . . , » . . r *' • ' » ' • V. 

> Note that the original Impd^l in tl-2) is a special case of this model 

/ i> ' w.ith ^ 3 - 0 and 6 3 0 , ; 
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t- , f 

S , 

L . • /"*.''' 



I*.. turns out to be more convenient "to rewrite the motfei as follows: 
" - • V t ^0 + ^% + r^ 0 ' + Y 3 X t +J V ' ' (7)' 

where r * * ^ fc * " "* 

This new model has an obvious* advantage over the*basic model*- it 
permits causal ef fee ts,4ver lag periods •shorter than (e Q , t). But it" 
intrWu^.-4 -new methaaologicai problem. /While the 'original model was . 
recursive-, the model in (7-8) contains simultaneous causation. Both X 
and Y t appear as* botir independent' and dependent variables in a pa^ir of 
aquations. So we have J.osfc^an important £ype of simplicity. More 4 
important, , the model in (7-8) is not identifiable (to use the language 
of econometrics). Its # parameters cannot be uniquely estimated from any 
data set. • ' \ 

V ■ '" ' t 

„ ■ The literature on simultaneous equations estimation ..commonly advises 

• two,.proc*dures Jr repairing unidentified models ( see ' Johnston 19 
,for example), One can use. theory or prior .research to plade constraints 
on ; the parameters of the model. Unfortunately we cannot avail our- 
selves of this strategy. as we lack such theory and evidence / v The second 
alternative invoives^the use ofi iristrumental variables. I* The model' in 
£7 r 8) can be identified by finding a pair, of variables ^h^are uncorrelat 
yith u 1 and v 1 such" that one of' them affects Y but^not X i directly 

anA the other affects *X £ but not ^ , ' Inserting these variables, into 

\ \ ■' ) I t 

estimators, wilr" resolve the indeterminacy of the model. But again we y 

t f ^ d ourselves blocked by the lack of prior theory As the literature 



' ' — : — • — ~ tor* — — • — 



« r . f.. ■ - .,.7.; .» 

r argues that almost everything affects everything e^se, it is har^ to w 



V 



argue aii thoi^tat ive ly a, priori that some variables behave as instrument^ 

.(that Is th'ey affect only so^^f jthe variabl/s in the model), ■ | > 

"/ ^ I \ .' * t 

In the long run, the resolution orfthe problem under discussion 

• t * V J 

* Mil likely involve these sorts of strategies. Our assessment was that 

* ' ' > • • - " • " . . * -\ 

the field was not - sufficiently well developed for u*s to rely on theory 

and p^or re'Search for the strong assumptions tt^at must ^e used to 
- motivate the simultaneous equations methods, Thus wa decided to settle 

.4 ■ L ^ . : ' ; . > 

\for s imp ler method s . . * . * , • 

) - t . * ' 

4- r Our situation is as follows. We propose to^use the model in (1-2) 

to draw inferences about institutional ana 1 structural change in nations. 
However j we realize that this model is not completely general and that 
strong arguments can be made for 'the superiority of the model in (7-8) , & 
So we must defen^l the choice* of the basic model over* J: he mor&^ complex 
alternative*^ I just argued^that the more complex mode^ cannot be 
^uniquely estimated as it stands, w But that is pot- A justification for 

the simpler model, pAjre there any general* arguments that favor the model 

^ ' " V " * - " ■ 

\ in (1-2)7 1 will advance three, , .* 

One line ot^argumen , ^^iaims'¥that change s^^uring the study period do 

not have effects, that ttae .batic model is correct, yhis amounts to 
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arguing that all the causal effects have a l&g at least as. long as twenty years 
But , it is seasy to show that i*f the lags are longer than twenty years we will 
.not observe systematic effects with jthis ^nodel* Therefore* this position 
requires that one'know exactly^he structure of the lags in the e"f fects* 
This sort of reasoning, motivated Heise 1 s (1969). treatment of - panel analysis, 

- * * / ' % 

) v < ' J * ■ 

^ f ^ N 



This position has^some appeal s Soma social scientists argue 

■ * v. 

that twenty years Is too ^short a period ovet, which to observe Inter- ^ 

Institutional effects,/ Readers who take this view will' not be troubled 

by. our neglect of .effects with lags shorter^ttian twenty years, However, 

we observe that some institutions, natably^politieal structures / change 

literally over night, Such changes can have strong and almost immediate 

effects on other institutions 1 ^ So we cannot always rest assured that 

twenty years, is a conservatively short period that rules out effects of 

- shorter lag, s •" ^ * 

■We find another aspect of this position disturbing. It presumes that 

process operates over discrete time intervals. In formal terms vf 

the model implied is a difference equation with a twenty year lag 1 

prefer fcjie view that adjustments to institutional changes ara continuous 

in time and hot specialized to any particular causal lag. What happens 

i * . ... . 

if we "model the process in this fashions that is as a continuous time 

process? Our. general argument presumes that changM in X ahd ¥ depend 

on the levels of both variables.' The simplest continuous * time model 

that is consistent with this argument is the linear system: 

Td? 1 m a 0 * a i Y(t) + a 2 x(t) '■■ > >. < 9 > 4 

^ L - b Q + bjXCt) + b 2 Y(t) . (10)' 

But the differential equations in (9-10) cannot be estimated directly. 
Instead we 'solve the system subject to appropriate initial conditions i? 
(see. Coleman 1968),. The solution to this system has exactly the same »' 



i 



9 • J 

,10 




form as our, bake modal, (i-2). That is, it is a two-equation model in 
which X^S^fwrfffs linearly on X^/ and t Q , and Y fc depends lintarly on Y 

and. Kq, The parameters of the panel model can thus be considered as 
« noti linear f une t long of the; parameters df the ^differential equation model 

(see Coleman 1968; Dorian and Hannan 1976; Hannan and Freeman 1978),, . 

This relationship may surprise* readers trained in classical panel 
analysis tradition^ It shows that the basic model is less restrictive, 
than conventional treatments icjly. Despite the fact that it ignores 
effects ,of AX^and ^ , it turns out to be an^implication of a model in* 
which X and Y adjust continuously to levels of X and Y during the entires * 
period. For details/ see Kaufman (1976), ' V 1 

■ • ' ' * X 3 ' ' • 

The linear system in (9-10) can be shown to -be a reasonably good 

approximation to a variety of more complex change models. This seems* a good 
argument in favor of the model in (1-p, However s _we cannot assert that 
the basic ntbdel is the correct model for panel analysis* Other more 
complex differential equation models will give rise to different panel 

-v" ■ .:. . ■, 

specifications* Nonetheless ? it is comforting to learn that %he basic 
mode^ has more general value than is commonly acknowledged in discussions 
of panel analysis, 

Both these arguments have a "best case" flavor, That 
is ? they depend on nature/ s working in» the interests of valid inference , 
But, .what 'happens if the more complex model is correct? * More concrete iy 



what are the consequences of estimating the basic modjg^ when AX and AY 

have causal effects? If the qualitative' implications from the basic model 

* ■ . *■ 

hold over plausible ranges of the parameters of the model with- change effects 

h f 

(7-8), this argues strongly 'in favor of the basic model. »If , on the other hand 



results from the basic m&del change -radically with alterations In 

e ■■■■ 

parameters of (7 - 8} f we spit proceed cautiously. 

In particular,' we must identify those ranges of parameter values over- 
which inferences from the basic model have some validity and restrict 
our attention to empirical .contexts that meet the conditions,^ 

The first- two arguments treat the basic model as a structural model 
in its own right, ; But in the present discussion, it must be considered 



as the reduced-form of the model in (7-8). That is. we reconeeptualize . 
the basic modal as an algebraic rearrangement of (7-8). It is obtained 

..." • \ * 0 ... • " 

by solving (7-8) for Y and : • 

SY fc - C- l [( Y{) -Mf^.J + (Y 1 ■ + VpY 0 +<V',+ Y 3 6^X 0 .> (Y3V +„')]: (U) 

X ,. " C_1 [( 6 0 + W + (6 1 + 6 3 Y 2 )X 0 + fl + 6 3 Y 1 )Y 0 + < 6 3 u ' +v '0 (12 ? 



7 V 



Where • ' 

c - i - v & • /, 

Equations (11) and (12) .provide a new interpretation of the coefficients 

g£ the basic model in terms o| the parameters * of the change model, 

* ■"•.'-*• i 

Recall that the change effects model is ; hqt^ identified. Thus one 

cannot use reduced -form coefficients to obtain unique estimates of the 

$ - 
or's and P f s, But as' long as u' and v 1 are >well- p behaved^ (i,e/, uncorrela'ted 

"\ * . . 

with X Q and Y ) can always obtain good estimates^ of reduced torm co- 

efficients. That is why ye investigate the usefulness of the reduced-form 

for informing us about causa! re lat io^**no Id ing * under the more complex 4 

model,- > . 

In using the basic model, we* report estimates of as the 



effect of Xq on Y £ and ^ as the effect of Y^ on X £ . We confine/ at - 

tent ion y to the former as treatment. of the two cases is completely parallel 

According to (11), - * . , ■ , - 

Thus we investigate whether Ine quantity in (13) is a reasonable measure 
of the effect of X on Y over time, ' ) 

i 

.'i "* 

The quantity in parentheses in (13) is familiar to structural 
equation analysts. It is the sum of a direct effect and an indirect 
^fect . The direct effect, is the so-called cross-lag effect. The 

indirect effect*, V 3 6 1 ^ is the effect of X Q on via X fc , Thus the 
reduced-form effects summarize both direct and indirect effects, The 
total effect of X Q (the sum of the direct and indirect effect) is 
multiplied by E" 1 . This multiplier adjusts the total effect for the cycle 
of causation implied by the model. According to the model, any increment 
in X_ t produces a change in Y fc , which in turn affects X £ , etc. So effects 
of X Q are propagated over an infinite cycle of causal loops joining X fc 
and Y . The multiplier is the appropriate rescaling of the effects of X Q - 
Thus to understand the reduced-form coefficients we must consider both ■ 
the total effects and the multiplier. 

Since the mode L is under ident if led , we cannot conduct an exhaustive 
analysis. The reader can always choose some combination of Y's and 5's 
that will cause trouble. However, it is informative to consider a series 
of special cases . 



12 

gasa It X has no effect Qn Y £ : ^ = 0, ^ = 0. Clearly a r} = Q 
for this case, Jhis result, though obvious, is important . It tells us 
that when*X has no effect, we will not mistakenly identify effects of Y or 
4 Y' on X t as effects of X on Y 

Case 2: X Q affects Y fc but AX £ does not: Y 2 >0 ? Y 3 ^ 0. Again the 

result is simple; ^ = Y 2 , the cross-lag effect of X Q in the structural . 

form. This holls even when &Y has an effect on X fc . 

Now we consider more troublesome cases. Throughout we assume then 

the autoregression, 5 , is positive, since this is the case m our empirical work* With- 
es 

out loss of generality, we assume that the effect of , Y^ is positive ns 

= 1 ' ■ ' 

well. In the general case, the multiplier, C , can take on any value, Until 

further notice we assume that C * is also positive, 

Case 3: X Q and &X £ have the same sign effects : Y 2 > 0 , Y 3 >0, 

In this case the reduced-form contains both a direct and an indirect effect. 
Since for the moment we assume a positive multiplier, the sign of the 
reduced-form effect is: 

Y 2 + Vl = Y 2 + V 5 1 " L)i (l4) 
If both Y 2 > 0 and > 0 , the quantity in (14) is positive when 

: \> X -X. ' <i5) ' • • 

3 ■ , . 

We will consider three cases. When the lagged effects and change effects 

are equal, i.e., Y^ - Y^, the requirements for the quantity in (15) to be 

positive is 6 ^ Q. This amounts to requiring negative feedback in the 

process generating the X . I mentioned earlier that we do not find, this * 

* *». 

problematic in our searciu 
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Similarly, when Y^ > ^j*-- t*i e cri-terion is that 6^ exceed some 
negative quantity, (1 - K) where K is the ratio of Y^ to "Y •- So 
neither is this case problematic . 

Finally, there is the case in ^hkh the change term effect exceeds 
the lagged effect; ^3^^2" ^ e criterion for the reduced=form effect 
to be *pus it ive* s that 6^ exceed 1 - K, it now potentially problematic, 
Since Y^ :> Y^ , 0 5 K £ _L Thus 6^ must exceed' some positive quantity, 

i 1 ^ 

For example, if Y^ is twice Y^ 3 then we require that 6 > ip So ? we will 
run into trouble when Y^ is much larger than Y^ and the autpregress ion 
of X is small. Such cases do not seem likely in our research as the 
autoregressions are rarely sntalj.. Nonetheless! we must recognize that 
the effect of X^ in the reduced form will have opposite sign from the 
affects of Sq and AX in the structural form (but see below) , 

So in all but exceptional circumstances, we expect that the sign of 
agree with those of the relevant structural parameters. However, quan- 
titative estimates of the effect of Xq will be wrong for two reasons. 
Using the reduced-form gives X^ .credjLt for the causal effects of 
This is a mistake as far as inferences about Xq are concerned/ 
But it is not wrong regarding the effect of X considered globally. 
We attach no special significance to the 1950 levels of variables, We 
have chosen the time points out of design convenience, So it does not 

upset me in this stage of inquiry to confuse effects of 4X with those 

4 t 

of Xq . The second reason why the quantitative estimates of differ 

from the structural parameters involves the multiplier. We delay discussion' 
of this issue. 
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Case 4; The system is close to equilibrium: X Q and iX ( haVe 



the same effect as do Y () and ±Y': V,, = Y 3 ; 6 2 = 6- When systems are 
close to equilibrium, the. effects of ( sraa 1 1 ) changes are close to the 
effects .of initial levels. In .this special case, , , ' 

i 

ff l " 1 " \ 

*- 1 

q m y 6 C r 

2 1 2 1* 

where TC ? - I - Y~Q~ " "1 -- Y 6 . Notice that 

V 2 = ^ 2 /P p and- 6 2 - $ 2 /a v * 
If, as is of t^n the case in our research, the reduced-form autoregressions 

and are close to unity, the reduced-form cross-effects are close 
to the structural-form cross-effects, . 

Case 5: X ^ and X. have opposite sign effects; Y 2 > 0 , Y^O. Here, 

— — =^ — ; 5 , = = i ; ~ 

we find real problems , as well we should, If lagged effects are positive 
and change effects are negative u any attempt to come up with an overall 
effect of X will mislead. The relevant expression is again (15), The autoregire ssion 
coefficient in the X equation, 6 again plays a central role in 
determining the sign of the reduced-form effect. If 6 < 1, then (15) is pos- 
itive and the reduced-form effect has the same sign as the structural 
effect of X^, When 6 = 1, the situation is even better; it is the same 
as Case 2. However, when .6^ exceeds unity, expression (15) is the sum of 
a positive and a negative quantity and (15) can be either positive or 
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negative-depending on the relative size gf Y and "(hpiding 6 - constant ) . 

. < ' - # ' ^ " 3 ( - . 'V- , ' • 

The qualitative result is similar to that in Case 3. Wjien Y« is much 

v . . ,-*ras5- J :... 

larger than Y 2 > we .can obtain the wrong- sign effect (here, we alsonwid 

. ■ . - f 

/ To this point we have seen that inferences from the basic model 

..0; -\. . . • 

have at least the proper sign as long as the effect of AX is ^not much 
... larger than the effect of X Q . This cone lusion ;; is reinforced when we 
focus on the multiplier. .Recall that C _1 - l/[r - Y^lr Thus it is 
positive when Y^ is less than' unity, if Y 3 & 3 > 1, the system will grow * 
.explosively. Any exogenous impact on X say, will be amplified through 

each cycle of X^ - Y £ - X^, The system is unstable and cannot long 

•V;; " ' ~~ ~ • ... ' , • 

maintain this causal structure . 

This is not' to say chat it can never happen in nature, Many of the 

structural properties we study grow exponentially during the period. 

However, in many analyses TT e adopt log transforms to linearize such 

growth, (See substantive chapters for more details, especially ,Ch. 6.) In 

light of such transformations, it does not seem likely that Y^ exceeds 

unity for the cases we consider. Nonetheless, we recognize Chat violations 

of this condition wreak havdc with inferences from the reduced-form. It 

is difficult to establish precisely the implications of explosive growth 

for reduced-form estimates since it is not obvious that <Sther parameters, '" 

e.g. , 6^, would remain unchanged under large increases in Y^ and 6 , All 

I can do is caution the reader that the value of even the qualitative 

inferences from our basic model may be incorrect if this condition is 

not met . 
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Thts third perspective on the basic model , then, leads to more 
% ■ ■ • . V 

Cautious conclusions. In general, it tells 'that we will not ko far 

wrong' when t/he effect:; of*&X t and Aif t are small »lat£ve to those of.X () 

and Y 0 and t^fgfowth is stable (In the transformed variables). 

: So the class of model's wo use can be justified from my of three per- 

spectives: (l^that- the basic model in (1-2) approximates the true causal 

stmecure,. operating with a 'certain lag; ^2) that (1-2) is the solution of 

a differential equation system relating levels' of 'x and Y to changls in X 

and^Y; or (3) that the basic model is the reduced- i^rm of the proper model 

which allows for causal effects over the study period, A At the 'time we began 

our research, the sociological literature emphasized the f rrst of these per- 

spectives. However, none of us believed that the lag periods weused cor- 

* r .' - * 

respond with any fundamental feature of the processes understudy. Thus we 

are forced to either of the^ o propositions * I favor the second; other mem- 

bers of our research group incline more towards the third. We agree , however 

that empirical analysies pKthe 5ort we report are necessary preliminaries to 

\ - 
adequate specification of the dynamics of national development, 

Issue s___i_n' E s t imat i on 

Our research context poses some special methodological issues. 

These issues hold with equal force when the basic modePis viewed as a 

- »\ 
structural model or whether it is 'considered a reduced-form for a more 

j - • * 

complex model. In this section I will identify the issues , outline 
our strategies for addressing them', and cite relevant technical . * ' 
literatures that, contain more detailed treatments, ,. - 

So far I have assumed that the disturbances 

i % 

(the total effect of all omitted variables) were well behaved. In 

particular I assumed that they were independent of the variables appearing 

J 

on the right ? hand, s ide of the basic model, If this is so, ordinary 

* 

• ■ 1 18 ■ 



ieast squares (GL8) estimators ate consistent (see Johnston 1972: Ch, ft)t t 

- V * ( v ., . 1 

If, in addition A the disturbances have constant variinee, OLS also 
asymptotically >fVicient , _ The /"first two complications we eons ider involve. 



iepenue-rice and cc 
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likely failures af the independence and constant variance assumptions* 

■•: - < • v * ■ 17'. • 1 

a. Autocorrelation r rea^efrs, Me undoubtedly sensitive to' the ?[ 

possibility, thalj: 'the disturbances are not independent from period to 
periqdU The problem is endemic to pane 1^ studies , and the studies we 
reporr\are no exception,. Consider the 'composition of the. d isturbances . 

In our research, the set of omitted variables includes: material infra- y. 3 

, - * i 

i ... • * 

structure (e,g,, national systems of , transport^ communication, etc,), ' «■ 

technology, cultural organ izat ion, national history, etc. Each of these 

features of social organization has t&o^propefcties (1) they are stable, that £ 

they do not A-irf radically from period to, period ; and (2) they affect \ 

' ' \ ' ' 

many ,of the processes we study, As long as I these causal effects are not 

included in the models they are forced into the disturbance terms, And, given 

of 9 * 6 t 

the pair/properties identified above, the disturbances will be correlated 

from period to period, " Nations *with unobserved characteristics that 

generate exceptionally high levels of some outcome in one period will teno 1 

to have exceptionally high levels in the next period, The more enduring ^ * 

arc the unobserved causal forces , the stronger wit! be the autocorrelation 

of the, d isturbances , the dependence from period to period. 

The impact of autocorrelation of disturbances on OLS estimators is 

well known (s«e, for example, Johnston 1972: Chs . 8, 40; Hannan and 

1 Young 1977), We must consider two effects. First, autocorrelation 

implies some no n- independence of observations; the analyst using such 1 

data has less information about £he process' than would: be given by the . 

same number of independem^observat ioivs , But, OLS assumes that the 

observat ions /a«^indC]5Cndent and calculations of confidence, intervals 

and tests of significance reflect this assumption* Thus use of OLS with 
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y correlated regressors, The 
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aut^correlated disturbances biases .standard errors towards zero and thus 
gives inflated levels .of Statistical significance! * ; 

■ . r * 

The second effect applies particularly to models, like the basic 

* model , that include lagged dependent variables. It is easy to show (by 

^# writing .^he basic model for the period fronil-t to kg and making appropriate 

substitutions into the basic model) that autocorrelation of disturbances 

implies that the disturbances at time, 0 will be correlated with both X 

and Yq - In this case OLS estimators are ,b lasted anxi inconsistent . 

We are concerned wifh the properties of OpS estimators in* small 

relevant . , j * # ' / • - * 

samples. We have some/ evidence from Monte^Carlo simulations. In 

regressions Sf a variable On only the lagged dependent variable, -auto- 

If w 

= 4 * " f 

^correlation (of the type termed "firfet-order autoregressive") produces * - 

* i 
an upward bias in^the "slope associated with the lagged dependent variable . ? . 

(Malinvaudj 1961), Addition of another regressor reduces this upw'ard bias. 

Nerlove (1971) and Harlan and Young (1977) obtained similar- results * 

• - 'S 1 "' • ' ' 

using a different error structure (a variance components models with random 
but constant individual-specific effects, see below), Ttie slopq of the- \ 
lagged dependent variable is biased upward while the, slope of an exogenous 
variable is biased downward, the lattet effect reflects • the negative 
correlation of estimators for posit ivej/ 

autocorrelation biases pushes the sloped of the lagged dependent variable 

upwards and thereby pushes, the slope of a correlate^ regressor downwards, 

The results of the simulation studies do not apply directly to ou c r, 

model. Here both X and Y^ are^ correlated with the disturbance in each 

.equation while a in the simulation studies the additional regressor was 
i * 



• not torrelated with, the- di|)urbanc4.. Non^thele^V,, tfe' results see. 
^hoid' approximately, %Ke Equation fo^&amfce. '/fhe/end^rlng p 
, o^ *he.'d-tsttybance < affe&i both Y Q and Yj_ directly bit X ohlyS ^ 



port i on 



* ihdifje'ctly tHrough'effects on Y_ t and effect of Y_. on-Xg. These indirect 
effects will usual ly much smaller than the dirtfct, effects, 'consequently 
the correlation of w^th the disturbance should be conside.rably'highe^ 
than that of X Q with the- . same disturbance , Then the coefficient of Y 



will be more .affected by autocorrelation bias,_ ic is no\ clear what will 
be the efface of autocorrelation^n the slope of X Q . Thirdepends pn 
thejfiegnitudes of the various effects and the strength of autocorrelation.' ; 
I cit^ot assert that the. sloped of X Q will be biased towards zero as N / 
in the simulation studies.' Nonetheless it seims highly unlikely that t>,e 
bias in this slope would be positive. In the cases we analyze I expfecT: 
this bias bo be' -relatively small and unsystematic . 

All but two of our empirical studies use OLS estimators. The 
" results. just cited imply that their estimates of slopes of the lagged 
dependent variables will be biased up'wards. The estimates of the cross- 
.effects on which we base our inferences may be biased\n complex ways,. 
But. it seemsTmUkely that any such biases will systematically distort our 
inferences across the multiple analyses that we report. ■ ' ' 

In two cases, Chapters 4 and /, we correct for the most pervasive 
form of. autocorrelation bias. In Chapter 4 we pool seleral waves of 
panel observations and estimate models that include disturbance terms 
that are specific each nation. The nation-specific, factors ' summarize 
= ,the most enduring features of social organization and are assumed to be 
constant over the study period. The model should also control for unobserved 
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lR causal factors f that. change slowly relative &o the processes under study. 
U&e of af pooled model enable us/to identify the "causal structure and to 
focn generalized least squares estimators of causal effects. . These estimators 
have been shown to have good small sample properties (Hannan and .Young, 1977) 
Chapter 7 alsa_.,treats the lac tors 'producing autocorrelation as un- 



observed latent variables, ( Use of multiple indicators permits identifiea- 
tion- of a mod^l wit}^ latent variables producing autocorrelation for each 
indicator p£ the dependent variable. The mo^ei . is. estimated by maximum 
Uk^ljUipod using Jbreskog's (1969, ^73) procedures for analyzing .linear „ 
structural equation models .j ' * ,\ 

b, .Hete r osceasticity : 'The assumption of constant error variance 
is also problematic for some of the outcomes we ^study. Some of our 
outcomes, measure the scale of a system, e.g., CNlV school enrollments, 
It is unlikely that Guyana, say, has the game error variance in either 
outcome as the Soviet Union or the U.S, A small percentage deviation for 
the latter is larger than the initial level for the former. In general^ 
period to period fluctuations for giant nations will be much larger than 
those for small nations, In other words, we expect that the error 

variance for such scale -variables to be increasing functions of the scale, 

■J* 

^ • Suppose the error variance of u £ in equation (1) is proportional to, 
say, the square of X Q , i,e, 

' • ' Var (u t ) = SX ttf -2 " ? 

Then, it is straightforward to correct for heteroscedasticlty by forming 

weighted least squares estimators (WLS ) . The procedure is as follows, 

<* • 
Divide the entire structural equation by X Q (this transforms the 
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= disturbance to u /X* (which' has variance constant across values 

*of X^. Apply OLS to the transformed equation but treat the coefficient 

'of X^ 1 as the constant u the^pnstant as the coefficient of X Q , and the 

K . ' ~ 

coefficient, of Y A /X rt as the coefficient of In other words ; we continue 

,00 0 

. ■ - c 

to fgcus on the original structural equation and introduce - the trans- 
formation only to repair a problem in the distribution of the disturbances. 

This*- procedure has ^a superficial similarity to use of structural * 
equation models 'for ratios of components to scale factors, e.g., 

A ■ ' 

enrollment ^ratios 5 ^pNF per capita. Much published research and some of 
our research deals with relations among' such patios . Alternatively we 
treat the relevant scale * factors , e.g. population size, as causal variables 
in their own right. Using the first strategy we might regress per 
capita GNP on lagged per capita GNP and some political variable. In. the 

second we would regress GNP on lagged GNP* the political variable ^ and 

V* 4 < 

■ ■*'•»*,. 

population size (perhaps at both time 0 and time t ) . Since, the latter 

v ...... - . < ■ 

strategy is used much less widely than the formed, the matter deserves some 

it* - 



Ideally our theories would arbitrate between these formuations. Since 
our theories lack sugh precision, we must use other criteria for choosing * 
between them. Use of ratio variables Is commonly defended as a method for 
adjusting or controlling for scale, But the second formulation does 
this' as well, though the nature o % f' the adjustment differs. Are t^iere any 
.other grounds for -choosing between the two? Some sociologists have 
recently argued thajfc the use of ratio variables may complicate inference 
(Freeman and Kronenfeld 1973; Schuessler 1973; Fugguit and Lieberson 1974), 
They have noted particularly the difficulty that arises when one regresses, 



Capita) . 



22 

say, the ratio k>£ expatriated* profits to GNP on .GNP (or Gfcr 1 per 4apita)\ 
SinCf the same variable, GNP, appears in the numerator on one side off . J 



the equation ^and as a denominator on^ the other, the relationship between 

the ratios is constrained, So*, for example', a hypothes Ls^that two 

variables be negatively related, does not face much likelihood of being 

rejected j , • 

** 

A more general problem has not been addressed in the sociological 

■ " . 1 

literature, Consider- a model in which 1970 GNP/Pop depends on 1950 GNP/Fop and 195( 

, relevant age group), 

primary enrollment ratio (number of primary students divided by size of the/ 

The coefficient for the cross-effect of the enrollment ratio on per capita 

•GNP summarizes a number of possible effects: the effect ot size of 1950*' 

enrolled population on 1970 GNP, a (nonlinear), effect "of 1950 enrollments 

on ^970 population, a (nonlinear) effect of 1950 school aged population on * 

1970 total population and on 1950 GNP, I would not attribute the same 

substantive significance to each of these effects, Therefore, I am 

unwilling to summarize all these effects in a single term, I prefer to model 

the process in terms of primitive variables for this reason, 

My argument hinges on my substantive orientation. Other members of 
our research group are more comfortable formulating arguments in terms 
of ratio variables. So we do not speak with one voice on this issue, 

c Measurement Error - Much of the work of our research project 
concerned measurement. We were constrained by the availability of data 
gathered by international organizations and , in several instances , by 
research groups. However, we devo ted cons iderable e f f or t to comparing 
estimates of the same quantity from numerous sources, checking che p " 
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internal consistency of various indices (both over time on the same V 
index and adross indices qf the same construct at a point in time ) 0 etc. 
Ar a result of this work. we believe that most of the data we use are of 
high quality/ 1 " ' : 

However, the sources of error in reports, of population size,' GNP ? and 
school enrollments are well documented. Although we. have eliminated 
many gross errors (and during the period of our research the U.N, 

published revised and corrected figures for the entire span we cover) 

* \ 

we have certainly not eliminated measurement error. And, even random 
measurement error biases OLS estimates of structural parameters, Elsewhere 
(Hannan, Rubinson, and Warren 1974) we discussed at length the likely 
patterns of random and no n random errors in these data, We proposed/ and ' 
estimated crudely some models with unobservable variables measured with 
8rr0r |f|? ultiple indlcators - Use of these models enables one to adjust 
estminates of structural" parameters for both random and specified nonrandom 
errors in" variables . Chapter 7 contains a more refined analysis of 

similar models using maximum likelihood procedures, 
j however, 

We have not, /used models with latent variables widely in our 

research. They demand the use of multiple indicators. For several of ' 

the variables we study, particularly political structure and economic ' 

dependence, measures are available for only small numbers' of nations 

on any one indicator, Using several indicators* at once reduces- the ..' 

effective sample size below the levels at which a multivariate analysis 

I 

can be trusted at all, This constraint is currently losing force as gaps 
in the data are being filled. As Chapter 7, the most recently completed of 
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■ the empirical papers, documents, sufficient data on political organiza- * 

. " - ^ K ^ - 

t ion are now available for the sensible use o.| unobservablc variables ^ ■. \ 

- ' v ' . 

models with- multiple indicators. We exp^^^ui^^uch' models, to . . \ 

±- . •* ■*■ ■; ' V A *- 'j * 

become increasingly important in future research*' on ■ thfese'^fisu^s . ,'ln 

the meanwhile, we take consolation in the findings in Chapter 7 and in* 

Hannan, Rub ins on, and Warren (1974) that many of the variables, we analyze are 

teasured quite reliably by sociological standards, ? 

d ' Functiona l Form of Relationships - Structural equation models are 

letimes criticized for limiting attention to linear-additive relation- 

\ ■ 

ships. Such a charge has no basis in logic' or fact, A structural 
equation model can have any functional form. Some of the models we 
estimate are linear in both variables and parameters, as in equations (1-2). 
Others are linear only in the parameters, as in regressions of logarithms 
of variables on lagged variables and other variables. The reader can 
establish that such relations are nonlinear by taking an ti logs of the ^ 
equation. Finally some of the models we estimate are nonlinear in the - 
parameters. For instance, in Chapters 4, 6, and 7 we estimate relation. - 
ships separately for rich and poor nations The functional forms used are ■ 
defended in the empirical chapters, 'This is not the place for any ex- 
tended discussion. Rather my point here is to emphasize that the- structural 
equation p^spective we'adopC doesfnot restrict us to any particular sub- 
stantive hypothesia , e.g., linear -additive relations. 



.f~ 



2 b 



Smrroary " , 

Most of our empirical work involves variations on a simple model. 
We treat each outcome as a function (sometimes linear) of a set of 
lagged variables, including the lagged dependent variable. This type 
of model entered the social science literature as an analogue to an 
experiment. But the model is not completely general and In no sense 
doe^s it render causal inference unproblematic . We- take the position 
that the model must be treated as a substantive model of the process 
and that the. quality of inferences from the model depends on the quality : 

of the substantive representation, I advanced three justifications for 

* . . . . . / 

the use of the simple model in our research. In the qourse of stating V 

' ; . • ' . \j 

thg.se arguments, 1 indicated possible difficulties with using this model 



to;study national social change* 

In large measure, these difficulties involve the strength of the ef- 
fects of changes in variables during the study period.. If the effects of changes 
are different in sign from the effects of initial levels, the basic model 
will tend to misstate causal effects. For reasons that I outlined, ,1 
do no A know and cannot determine empirically Whether this is a problem in 
our research In effect, we have proceeded under the assumption that the 
effects of initial levels of variables represent reasonably well the 
effects of all levels that occur during the" period". This, then, is. the 
major substantive assumption underlying our work. 

We reject the view that special estimation procedures must be 

• " . ' . & 

designed for jp an el studies. Instead s we treat the model as a structural 

fi « ;: e(^uat ion mod e 1 Sid use structural equation, procedures, Our work was 

%-f - 1 * 



conducted during a period in which sociological methodology underwent 
profound changes. In the early years of our research, the sociological 
literature did not offer useful guidelines on most estimation issues. 
However.' rapid progress has been made and sociologists have begun to 
utilize a wider variety of estimator*. ' The empirical chapters completed 

' * .% Tin : - * 1 

in different epochs of the project reflect 'some of these differences. 
In our earlier research we relied^avily on ordinary least squares 
estimators. As ou* research progressed , we altered procedures to, handle 
•certain complications . particularly autocorrelation, hiteroscedasticity 3 
*and measQt-ement error, in most cases the mora complex procedures had 
never been applied to studies of Rational social change, pur experience 
is that more refined analyses generally give the sam/quali native picture 
as did our earlier analyses (with certain exceptions-- see Chapter 7), 
but with a sharper focus, '* . i-*«rfave that it is important that future - 



research on national social change extend the use of these, and related; 
lures . ,. . * - 



* Footnotes 
"^Lazarsfeld 's work actually led to two traditions, His treatment of six- 
teen-fold tables (the full cross-classification of two diehotomous variables 
measure at two points in time) involved the line of thought discussed in 
the text. At the same tiro, hii treatment of the -so-calleArtight-fold 
table (the cross-classification of later measures on one Variable on the 
earlier values of both variables) was much closer to the position we take, 
2 • * 

For example. It is easy to develop simple situations in which such measures 

fall. For example s suppose X^ affects Y fc but Yq does not affec£ X . 

Suppose further that X is highly autocorrelated but Yq is not and that X Q 

and Y Q are reasonably highly correlated. Then the cross-lag correlation 

^OT€faure will lead to the inference that Y'causes X (the correlation of 

,/ ■■ • • ■■■ - 

Y Q with JC t will be larger than the correlation of X Q with Y^)". It is 

equally easy to come up with additional complications that have the same 

consequence for the cross-correlation test, Kenny (1973) has stated a set 

of conditions under which a generalization of cross- lag correlation pro- * 

cedure yields valid inferences, These restrictions are exceedingly * 

restrictive and appear to hold only for systems in equilibrium (among 

other things the contemporaneous correlations ffliong variables must be 

constant over time)* The i true tural-equat ion approach is suited to a wide 

range of situations beyond the scope of cross-lag correlation analysis"/ 

3 ' 

For reviews of this perspective on panel analysis, see Duncan (1968-69, 1972, 
1975), Goldberger (197 ), Hannan and Young (19^) si #aftd Heise * (1969 J* 



''*/: ■ - , ■ . f 



ERIC 



